On the elementary symmetric functions of a sum of matrices 



R. S. Costas-Santos* 



September 17, 2009 



Abstract 



Often in mathematics it is useful to summarize a multivariate phenomenon with a single 
number. In fact, the determinant - which is denoted by det - is one of the simplest cases and 
many of its properties are very well-known. For instance, the determinant is a multiplicative 
function, i.e. det(AB) — det A ■ detB, A,BG M n , and it is a multilinear function, but it is 
not, in general, an additive function, i.e. det (A + B) ^ det A + det B. 

Another interesting scalar function in the Matrix Analysis is the characteristic polyno- 
mial. In fact, given a square matrix A, the coefficients of its characteristic polynomial 
XA(t) '■= det(tl — A) are, up to a sign, the elementary symmetric functions associated with 
the eigenvalues of A. 

In the present paper we present new expressions related to the elementary symmetric func- 
tions of sum of matrices. 

The main motivation of this manuscript is try to find new properties to probe the following 
conjecture. 

Bessis-Moussa-Villani conjecture: [2, 4] 

The polynomial p(t) :— Tr((A + tB) m ) G M[t], has only nonnegative coefficients whenever 
A, B G M r are positive semideflnite matrices. 

Moreover, some numerical evidences and the Newton-Girard formulas suggested to us to 
consider a more general conjecture that will be considered in a further manuscript. 

Positivity Conjecture : 
The polynomial Sk({A + tB) m ) G R[i], has only nonnegative coefficients whenever A, B G M r 
are positive semideflnite matrices for every h = 0, 1, . . . , r. 

It is clear that the BMV conjecture is a particular case of the positivity conjecture for 
k = 1 since Si = Tr. 
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1 Introduction 

Denote by M m ^ n the set ofmxn matrices over an arbitrary field IF and by Ivl n the set M n n . 
Determinants are mathematical objects that are very useful in the matrix analysis. 

In fact, the determinant of a matrix A E M n , can be presented in two important, apparently 
different, but equivalent ways. 

The first one is the Laplace expansion: 
If A = [aij] and assuming that the determinant is defined over M n _i, then 



n 



n 



det(A) = £(-1) 



i+j 



det(A hj ) = £(-1) 



i+j 



a it j det(Aij), 




3=1 



i=l 
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where Aij € M ra _i denotes the submatrix of A resulting from the deletion of row i and column 
3- 

The second way is the alternating SUM: 

det(A) = ^2 sgn(cr) a li0 . (1 )a 2)(T (2) ■ ■ ■ a n ^ n) , (2) 

where P n is the set of all permutation of {1,2, ... ,n}, and sgn(cr) denotes the sign of the 
permutation a. 

Remark 1.1 Notice that with these definitions it is clear that the determinant is a multilinear 
function. 

In the present paper we present a closed expression for 

det(Ai + A 2 + ••• + Ajv), A l ,A 2 ,...,A N e M n , 

where N > n + 1, in terms of the sum of another determinants involving the matrices A±, A 2 , 
A N . 

Definition 1.2 Let A G M m>n . For any index sets a, j3, with a C {1, . . . ,m}, (3 C {1, . . . ,n}, 
and \a\ = \(3\, we denote the submatrix that lies in the rows of A indexed by a and the columns 
indexed by (3 as A(a,(3). 



For example 

" 1 9 -2 
2 17 11 



({1},{1,3}) = [1 9] 



On the other hand and taking into account some properties of the determinant it is well- 
known that the characteristic polynomial of a given square matrix A can be written as 

XA(t) = det(t/ — A) = t n — S 1 (A)t n ' 1 + ■■■ + (-l) n S n (A), 

where / € M n is the identity, and Sk(A) is the elementary symmetric function associated to the 
matrix A, k = 1, 2, . . . , n. 

In fact, by the second way as we have defined the determinant, i.e. the alternating sum, it 
is straightforward that 

S k (A) = det(A(a,a)), k = l,2,...,n. (3) 

\a\=k 

In connection with the elementary symmetric functions we present new equalities related to 
these functions, giving explicit expressions for S 2 {A + B) and S${A + B), for any A,B G M n . 

The structure of this paper is the following: In Section 2 we present some results related with 
the determinant of sum of matrices, whose proof is given in the appendix. In Section 3 obtain 
the values of S 2 {A + B) and Ss(A + B) by using the definition of the elementary symmetric 
functions of a matrix, in Section 4 we prove the same identities and also we obtain £4(^4 + B) 
by using the Newton-Girard identities, where A and B are two generic n-by-n matrices. 
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2 The determinant of a sum of matrices 




Let N be a positive integer and let us consider the iV-tuple of n-by-n matrices 

S := (A 1 ,A 2 ,--- ,A N ). 

We define as the set of all possible formal sums of matrices of S where each Ai, i = 1, . . . , N, 
appears at most once. 

Remark 2.1 Note that WLG we can add the null matrix, 0, in 

The following result will be useful for further results 

Theorem 2.2 Given A G M n and an integer N , with N > n + 1. For any N -tuple S = 
(Ai, A2, • • • , An), Ai € M n , i = 1, . . . , N, the following relation holds: 

£(_!)* £ detU+£ A] =0, (4) 

k=o neT,(s) \ A,en J 
|fi|=fc 

understanding that \Q\ = k means that Q, is a formal sum with k summands, and that Ai £ Q 
means that Ai is a summand in Q. 

Remark 2.3 The identity (4) can be rewritten as 

£ { _ 1)X1+ ... +XN det 1 A+ j2 XjA . 1 =0 . (5) 

X1,...,XN=0 

Chapman proves in [7] the case A = of this. But his argument works as well in this generalized 
form; the determinant is a polynomial of degree less that N in the variables x\, xn and 
this alternating sum must vanish as seen by applying to any monomial of degree less than N . 
Alternatively (5) follows by subtracting the N + 1 case of Chapmans identity from the N case. 

For instance, if we set A = in (4) and Ai, A 2 , A 3 , A4 <G M 3 , i.e. N = 4, then 

det(Ai + A 2 + A 3 + A A ) = det(^i + A 2 + A 3 ) + det(Ai + A 2 + A A ) 

+ det(Ai + ^3 + M) + det(A 2 + A 3 + A 4 ) 

- det(A 1 + A 2 ) - det(A 1 + A 3 ) - det(A 1 + A 4 ) 

- det(A 2 + As) - det(A 2 + A 4 ) - det(A 3 + A 4 ) 
+ det(A 1 ) + det (^ 2 ) + det(A 3 ) + det(A 4 ). 

This result has very interesting consequences. 

Corollary 2.4 Under the conditions of Theorem 2.2. For any index sets a, (3 C {1,2, ... ,n} 
of size t, N > t + 1, the following relation holds: 

J2(-l) k J2 det ( A(a,(3) + M<*,P) I = 0. (6) 
\n\=k 

The proof follows from Theorem 2.2 replacing A by A(a,/3) and taking into account that 
Ai(a, f3) € M T and N > r + 1. 

On the other hand, if we combine the above result and (3) we obtain that: 
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Corollary 2.5 Under the conditions of Theorem 2.2. For any nonnegative integer r, N > r+1, 

e SrU+x;^) =°' en 

|n|=fe 

where S T (C) is the T-th elementary symmetric function of the matrix C. 

The proof, again, is straightforward taking into account (3) and that N > r + 1. 

The following identity is useful to compute r-th elementary symmetric function of any num- 
ber of matrices N > r + 1. 

Corollary 2.6 Under the conditions of Theorem 2.2. For any nonnegative integers r, N > t+1, 
the following identity fulfills 

SMi + A 2 + • • • + A N ) = E(" 1 )' ( 3 + N N _~ T _~ l 1 ) E S t[1lA- ( 8 ) 

|n|=r-j 

which proof is elementary and we leave it for the reader. 

In fact Theorem 2.2 is optimal with respect to the range of N, i.e. for every positive integer 
n, it is possible to find n-tuples of M n such that the equality (4), given in Theorem 2.2, fails. 
For instance, taking 

Ai = diag(ej), i = l,2, ...,n, A = xe±, 
where {e\, e2, ■ ■ ■ , e n } is the canonical basis of R n , it is straightforward to check that 



n 



^(-1)" E det U + E A = (-l) n (l + x-x)^0. 
k=o ne£(s) \ A t en J 
\n\=k 

3 Obtaining S 2 (A + 5) and S3 (A + 5) 

So the next logical step is to get closed expressions for the r-th elementary symmetric functions 
of a sum of N matrices, with 1 < N < r. To do that we will use the Newton-Girard formulas 
for the elementary symmetric functions (see e.g. [5, §10.12]) and the definition of such functions 
(3). 

Remark 3.1 Note that if A is n-by-n, then det(A) = S n (A), so it is enough to obtain those 
identities for the elementary symmetric functions and then apply these to the determinant. 

Lemma 3.2 For any Ai,A 2 € M n , we get 

S 2 (A 1 + A 2 ) = S 2 (A 1 ) + S 2 (A 2 ) + S^AJS^At) - S^A^), (9) 

S 3 (Ai + A 2 ) = S 3 (Ai) + S 3 (A 2 ) - Si(Ai + A 2 )S 1 (A 1 A 2 ) 

+ Si(Ai)5 2 (A 2 ) + Si(A 2 )S 2 (Ai) + 5i(Af A 2 ) (10) 

+ S 1 (A 1 Al), 

S 3 (Ai + A 2 + A 3 ) = S 3 (Ai) + 5 3 (A 2 ) + S 3 (A 3 ) - Si(Ai + A 2 + A 3 ) 
x Si(AiA 2 + AxA 3 + A 2 A 3 ) + 5 1 (A 1 )(S 2 (A 2 ) 
+ S 2 (A 3 )) + 5 1 (A 2 )(5 2 (A 1 ) + 5 2 (A 3 )) + ft (A3) (11) 



x 



(S 2 (Ai) + S 2 (A 2 )) + Si(A?A 2 ) + 5i(AiA 



+ Si(Af A3) + ^(AxAl) + 5!(A|A 3 ) + ft(A 2 Al) 
+ ft(A 1 A 2 A 3 ) + ,S 1 (A 1 A 3 A 2 ). 
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Proof: Let A±,A 2 G M n be two matrices with spectra cr(Ai) = {Ai,...,A n } and (7(^2) = 
{/ii, . . . , jin}, respectively. The Newton-Girard formula gives 

f> 2 = S 2 l (A 1 )-2S 2 (A 1 ) 1 

i=l 

where S\(Ai) = Tr(A\). WLG we can assume A\ diagonal, then by definition of S 2 (see (3)) 
we get 

n 

S 2 (A 1+ A 2 ) = hX j + 2Y,H Tr ( A 2)-h, j ) + S 2 (A 2 ) 

l<i<j<n j=l 
1 n 2 1 n 

= 2^ X >) -2E A J 2 + 2Tr ( A i) Tr ( A 2)-2Tr(A 1 A 2 ) 
3=1 3=1 
+S 2 {A 2 ) = ^(Tr^)) 2 - I((rr(^)) 2 - 2S 2 {A 1 )) 

+2Tr(A 1 )Tr(A 2 ) - 2Tr(A 1 A 2 ) + S 2 (A 2 ) 

= S 2 {A 1 ) + S 2 {A 2 ) + Tr^Tr^) - Tr(AiA 2 ). 

And for Ss(Ai + A 2 ), if A\ is a diagonal matrix, by definition of S3 (see (3)) it is straightforward 
to get 

S 3 (A 1 +A 2 )= A ^' A fc + Yl X i X j( Tr ( A 2)-hi- b jj) 

l<i<j<k<n l<i<j<n 
n 

+ ^A J -S 2 ((A 2 ) jJ ) + 5 3 (A 2 ). 
Taking into account that in this case the Newton-Girard formula produces the identity 

n 

J2 X i= Si 3 (Ai) - 35i(Ai)5 2 (Ai) + 3S 3 (A 1 ), 
i=i 

3 



and the expansion of (a + b + c) , we obtain 

S 3 (Ai + A 2 ) = i ((Tr(Ax)) 3 + 2Tr(Af) - 3rr(A 2 )rr(A!)) 

+ S 2 (A 1 )Tr(A 2 ) - Tr(Ai)Tr(AiA 2 ) + Tr{A\A 2 ) 

n 

+ ^A^ 2 ((A 2 ), J ) + S 3 (A2). 
j'=i 

But we can assume A2 is a diagonal matrix and say 

S 2 {{A 2 )jj) = S 2 {A 2 ) - fj,jTr(A 2 ) + fi], j = 1, 2, . . . ,n. 

So, 

S 3 (Ai + A 2 ) = S 3 (Ax) + S 2 (Ai)Tr(A 2 ) - TriA^TriA^) 

+ Tr(AlA 2 ) + 3Tr(Ai)S 2 (A 2 ) - Tr(AiA 2 )Tr(A 2 ) 
+ Tr(AiAl) + S 3 (A 2 ), 

and hence both relations, (9) and (10), hold. Moreover (11) is a direct consequence of (10). □ 
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4 Other way to obtain S 2 (A + 5), S 3 (A + B) and S A (A + B). 

We will start probing S 2 (A + B) using the Newton-Girard identities: 

-2S 2 (A + B)= Si ((A + £) 2 ) - S?(A + 5) = -S\(A) + Si (A) - S 2 (£) 

+Si(B 2 ) - 2S 1 (A)S 1 (B) + 2Si (AB) = -2S 2 (A) - 2S 2 {B) 
-2Si(A)S'i(B) + 25i (AB). 

We will apply an analogous way to obtain S3 (A + B): 

3S 3 (A + B) = S X {{A + B) 3 ) - Si((A + B) 2 )S 1 (A + B) + S 1 (A + B)S 2 (A 
+B) = Sf(A) + 3Si(AB) + 3Si(AB 2 ) + Si(B 3 ) - Si (A) 
xSi(A) - Si(A)Si(B) - 2Si(AB)Si(A) - 2Si(AB)Si(B) 
-Si(B 2 )Si(A) - 2Si(B 2 )Si(B) + Si(A + B)S 2 (A + B). 

If now we expand S 2 (A + B), after some simplifications it is clear we get the desired identity for 
Ss(A + B). 

4.1 Obtaining S 4 (A + B) 

As the above examples, the Newton-Girad formula gives 

-4S 4 (A + B)= Si ((A + Bf) - Si ((A + B) 3 ) Si (A + B) + Si ((A + B) 2 ) 

xS 2 (A + B)-S 1 (A + B)S 3 (A + B). 

Taking into account the properties of the trace, we get 

-4S 4 {A + B) =S 1 (A i ) + 4S 1 (A 3 B) + 4S 1 (A 2 B 2 ) + 2S 1 ((AB) 2 ) +4Si(AB 3 ) 
+Si( J B 4 ) - Si(A)Si(A) - S 1 (A S )S 1 (B) - 3S 1 (A 2 B)S 1 (A) 

-3S 1 (A 2 B)S 1 (B) - 3S!{AB 2 )Si(A) - 3Si(AB 2 )Si(5) 

-Si(B 3 )Si(A) - S 1 ( J B 3 )Si(B) + (Si (A) + 2Si(AB) 

+Si( J B 2 )) (S 2 (A) + S 2 (B) + S!(A)Si(B) - Si(AB)) 

-(Si(A + B))(S 3 (A) + S 3 (B) + Si(A)S 2 (B) + S 1 (B)S 2 (A) 

+S l (A 2 B) + Si(AB 2 ) - S 1 (AB)S 1 {A) - S 1 (AB)S 1 (B)) 
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Applying the same technique applied before, we get 

-4S 4 (A + B) = -4S 4 {A) - 4S 4 (B) 

4Si(A 3 B) + ASi(A 2 B 2 ) + 2Si ((ABf) + 4S\(AB 3 ) 

-5i(A 3 )Si(5) - 3Si(A 2 B)Si(A) - 3S 1 (A 2 B)S 1 (B) 
-3Si(AB 2 )S!(A) - SS^AB^S^B) - S^B^S^A) 
+S 1 (A 2 )(S 2 (B) + Si(A)Si(B) - S^AB)) + 2S 1 (AB)(S 2 (A) 
+S 2 (B) + Si(A)Si(B) - Si(AB)) + Si(B 2 )(S 2 (A) 

- 5i(AJ3)) - S!(A)(S 3 (B) + 5i(A)5 2 (B) 
+Si(E)S 2 (A) + Si(A 2 5) + Si (AS 2 ) 

- S 1 (AB)S 1 (B)) - 5 1 ( J B)(S 3 (A) 
+Si(A)S 2 (£) + Si(B)S 2 (A) + Si(A 2 5) + Si (AS 2 ) 
-SiCABJSi^-SifABJS-iCB)). 
After some simplifications applying the Newton-Girard formulas we get 

4S 4 (A + B) = 4S 4 (A) + AS A (B) - 4Si(A 3 5) - ASi{A 2 B 2 ) - 2Si ((AB) 2 ) 

-4Si(AB 3 ) + 3S l (A 2 B)Si{A) + 3Si(A 2 B)Si(B) + 3S l (AB 2 ) 
xSi(A) + 35i(AS 2 ),Si(5) - 2^1 (AB) (Si (AJfi'iCS) 
-S'i(AS)) + Si(A)(Si(A 2 5) + Si(AS 2 ) - ^(ABJS^B)) 
+S l {B){S l (A 2 B) + S^AB 2 ) - 5i(AB)5i(A)) 
+4S 3 {A)S 1 {B) + 453(5)51 (A) + 45 2 (A)5 2 (S) 
-4S 2 (A)Si(A£) - AS 2 {B)S 1 {AB). 
Applying the Newton-Girard formulas and after some simplifications, we get 

S 4 (A + B)= S 4 (A) + Si(B) - S!(A 3 B) - 5i(A 2 S 2 ) - S^AB 3 ) + Si(A 2 £) 
xSi(A) + S 1 (A 2 B)S 1 {B) + S^AB^SM) + ,Si(AB 2 )5i(5) 
-S 1 (AB)S 1 (A)S 1 (B) + S 3 (A)S 1 (B) + S 3 (B)Si(A) + S 2 (A) 
xS 2 (B) - S 2 (A)S!{AB) - S 2 (B)S 1 (AB) + S 2 (AB). 

5 Conclusions and Outlook 

We have constructed the 2nd, the 3rd and the 4th elementary symmetric function of a sum of 
two matrices but, of course, is simple to see that is possible to compute the r-th elementary 
symmetric function of a sum of iV-matrices, 1 < N < r by using the Newton-Girard formulas 
or by using the same technique used in Lemma 3.2 which, by the way, is too much complicated. 

Of course, one of the goals in further papers is to find a closed expression in the general 
case which for the moment is not clear although we believe the Theory of partition of integers 
is involved. 
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In fact, by using the generalized Waring's formula [6], for any A is n-by-n matrix, < k < n 
and any nonnegative integer m, we get 

S k (A m ) = (-l) fe (™ +1 ) £ A ^x{A), 

\\\=kn 

where the coefficients A\ are given by 

A = V V n ( l(Xi) \ 

7r=(fci,fc 2 ,... ) \\i\=kin 

and e x (A) = S™ l(X) (A)S^ W (A) ■ ■ ■ 

Remark 5.1 A partitionis a finite sequence (Ai, A2, . . . , A r ) of positive integers in decreasing 
order, where l(X) denotes the length of the partition, and mfc(A) denotes the number of parts of 
A equal to k. 

A U fi is the partition whose parts are those of A and fj,. 

Taking into account this identity we believe that we can obtain an analogous expression for 
the r-th elementary symmetric function of a sum of matrices. In fact, we expect one expression 
in which appears the elementary symmetric functions on words of the letters A±, A2, ■ ■ ■ , An 
as one could see in subsection 4.1 for the case S±(Ai + A2). 

Acknowledgements: The author thanks the referee for the constructive remarks and the 
valuable comments. This work has been supported by Direccion General de Investigacion (Min- 
isterio de Educacion y Ciencia) of Spain, grant MTM 2006-13000-C03-02. 



A Proof of Theorem 2.2 

We will prove by induction on n: 

• If n = 1 the matrices are scalars so, for every k, 

£ det(A+ E Ai enA)= £ (A + £ A) 
\n\=k \n\=k 

'i) A+ { N k-iy Ai+A > + --- +AN) > 

and hence (4) holds for n = 1 and N > 2. 

If we assume that the result holds for n, let us going to prove the identity (4) for n + 1: 
Taking the Laplace expansion through the first row, we get 

N N n+l 

£(-!)* £ det(A +£» = £(-!)* £ {£(-1)^ 

k=o neE(S) A t en k=o fie£(S) j=i 

\n\=k \n\=k 

x(A{l,j) + £ 4(1, j)) det (Au + £ 
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By induction, since for every j = 1, 2, . . . , n, A(l,j) is fixed and does not depend on k nor 
$7 we get that the above expression is equal to 

n+l N 

E(- 1 ) 5+1 E(- 1 ) fe E ( E ^(i, j)) det + e (4)u) 

i=i fc=o nes(5) A^en A^en 

|n|=fe 

n+l AT N 

= E(- 1 ) J+1 E(- 1 )"E^(i,j) E det^- + e 

j=l fc=0 A=l QgS(S) A^en 

A A en 
\n\=k 



Now, if we assume that any set with less than one element has determinant equal to zero, 
we get 

n+l N N 

£(-i) j ' +1 £(-i) fc 5>A(u) E det (^ij + ( A x)i,j + E (^Hj) 

\n\=k-i 



n+l AT JV-1 

= £(-l) J ' +1 £^(l,j) £(-!)* E det(^ lj - + (A x ) hj + E 

i =1 A=1 fc=0 nGS(S\{A A }) A t en 

|n|=jfc 



By induction, since for every A and j, the matrices -Aij, (^4a)ij £ -^n-i are fixed, thus 
A = Aij + (A\)ij £ M ra _i is also fixed. Thus we get 

n+l N 

E(-i) j+1 E^(i,i)-o = o. 

j=l A=l 

Moreover, since N > n + 2, then TV — 1 > n + 1. Hence, the relation holds. 
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